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Abstract. We study the trajectories of a solution Xt to an Ito stochastic differential equation 
in R d , as the process passes between two disjoint open sets, A and B. These segments of the 
trajectory are called transition paths or reactive trajectories, and they are of interest in the 
study of chemical reactions and thermally activated processes. In that context, the sets A 
and B represent reactant and product states. Our main results describe the probability law of 
these transition paths in terms of a transition path process Yt, which is a strong solution to an 
auxiliary SDE having a singular drift term. We also show that statistics of the transition path 
process may be recovered by empirical sampling of the original process X t . As an application of 
these ideas, we prove various representation formulas for statistics of the transition paths. We 
also identify the density and current of transition paths. Our results fit into the framework of 
the transition path theory by E and Vanden-Eijnden. 

1. Introduction 

In this article we study solutions Xt € M. d of the Ito stochastic differential equation 

(1.1) dX t = b(X t )dt + V2a(X t )dW t , 

where [Wt,Jt) is a standard Brownian motion in R rf , defined on a probability space (0, J~, P). 
This diffusion process in M. d has generator 

Lu = tr(aV 2 u) + b • Vu, 

where a := aa T is a symmetric matrix. We suppose that a(x) is smooth, uniformly positive 
definite, and bounded: 

A|£| 2 < (£,a(x)f> < A|e| 2 , VfeM d , V x e R d 

holds for some A > A > 0. We suppose the vector field b(x) is smooth and satisfies conditions 
that guarantee the ergodicity of the Markov process Xt and the existence of a unique invariant 
probability distribution p(x) > satisfying the adjoint equation 

(1.2) L*p = ( aij (x)p{x)) XiXj - V • (b(x)p(x)) = 0. 
For example, this will be the case if 

limsup sup x ■ b{x) < —r 

m^r+oo \ x \=m 

with r > 1 + (d/2) |Ver97j . 

Suppose that A, B C R d are two bounded open sets with smooth boundary and such that A 
and B are disjoint. Because the process is ergodic, Xt will visit both A and B infinitely often. 
Inspired by the transition path theory developed by E and Vanden-Eijnden [EVE06 , MS VE06 
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(see also the review article [EVElOj ). our main interest is in those segments of the trajectory 
t \— f Xt which pass from A to B. These transition paths and are defined precisely as follows. 
First, for k > 0, define the hitting times r Ak and k inductively by 

ia£{t >0\X t eA}, 



'AO 



and for k > 0, 



r+ = inf{t > r+ | X t S B}, 



T 



B,k+1 



M{t > r+ | X t e B}. 



We will call these the entrance times. Then define the exit times 

su V {t <r+ k \X t e A}, 



'A,k 

T B,k = s Mt < rXk+i \x t eB}. 
These times are all finite with probability one, and r Ak < r A ~ k < k < k < T Ak+1 for 
all k > (see Figure [Q. If t E [ T Ak> T Bk\ ^ or some k, we say that the path X t is A — > B 
reactive. Let = (A U B) c , and hence d@ = dA U dB. For k E N, the continuous process 
Y k : [0, oo) -> G defined by 



(1.3) 



Y* = X { 



is the k A — > B reactive trajectory or transition path. Observe that Y = X T - E dA, 
and that Y t k = X + E dB for all i > r^ k - r Ak , and that Y t k E 9 for all iG(0,r^- T Ak ). 

B ,k ' ' ' ' 



A 



A.l 



B 



T B,0 




A \ 









'B,l 

Figure 1. Illustration of a trajectory with entrance and exit times. The transi- 
tion path from A to B is marked in red. 

Our main results describe the probability law of these transition paths in terms of a transi- 
tion path process, which is a strong solution to an auxiliary stochastic differential equation. 
In particular, empirical samples of the reactive portions of Xt may be regarded as sampling 
from the transition path process. The motivation comes from the study of chemical reactions 
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and thermally activated processes where understanding these reactive trajectories are crucial 
[DBG02,BCDG02j. In these applications, the domains A and B are usually chosen as regions in 
configurational space corresponding to reactant and product states. Mathematically, our results 
fit into the framework of the transition path theory |EVE101lEVE06llMSVE06] . 

Having identified the transition path process, we can compute statistics of the transition 
paths by sampling directly from the transition path SDE, rather than using acceptance/rejection 
methods or very long-time integration on the original SDE. Of course, this assumes knowledge 
of the committor function, which is non-trivial. In any case, our results might be used to analyze 
methods of sampling reactive trajectories. 

We will now describe our main results and their relation to other works. Proofs are deferred 
to later sections. 

1.1. The transition path process. Our definition of the transition path process is motivated 
by the Doob /i-transform as follows. Let ta and tb denote the first hitting time of Xt to the 
sets A and B, respectively: 

T A = inf {t > I X t G A] , 

( L4 ) 

tb = mf {t > | X t G B} . 
Let q{x) > be the forward committor function: 

(1.5) q(x) = F(t a >t b \X = x), 



which satisfies Lq(x) = for i£9 = (AU B) and 
(1.6) q(x) = 




By the maximum principle, q{x) > for all x G O. By the Hopf lemma we also have 

(1.7) sup n(x) ■ Vq(x) < 0, inf n{x) ■ Vq(x) > 0, 

xedA X ^9B 

where n(x) will denote the unit normal exterior to O (pointing into A and B). For x G O, 
consider the stopped process X tArAArs with Xq = x, and let V x denote the corresponding 
measure on X = C([0, oo)): 

V X {U) =F(X G U | X = x), V[/G6 

where B is the Borel cr-algebra on X. If Aab denotes the event that ta > Tb, the measure Q% 
on (X,B) defined by 

dQl = Ia ab = Ia ab 

dV x V X {A AB ) q(x) 
is absolutely continuous with respect to V x , if x G O. By the Doob /i-transform (see e.g. |Day92| , 
[Pin951 Theorem 7.2.2]), we know that Q% defines a diffusion process Y t on C([0,oo)) with 
generator: 

(1.8) LV = -L(qf) = tr(aV 2 /) + (*> ■ V/) + ^ ■ V/ = Lf + ^ • V/. 

q q q 

So, the effect of conditioning on the event tb < ta is to introduce an additional drift term. 
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This observation suggests that the A^t B reactive trajectories should have the same law as 
a solution to the SDE 

(1.9) dY t = (b(Y t ) + ggreggKh dt + V2a(Y t ) dW t , 

v q{Yt) ) 

originating at a point Yq = uq G cM and terminating at a point in 95. While the SDE (jl.9p 
admits strong solutions for yo £ Q since g(x) > in 0, the drift term becomes singular at the 
boundary of A, where q vanishes. Our first result is the following theorem which shows that 
there is still a unique strong solution to this SDE even for initial condition lying in dA. For 
convenience, let us define the vector field 

(i-io) Jf W = (w + 25 ^ M )- 

Theorem 1.1. Let (W,T^) be a standard Brownian motion in M. d , defined on a probability 
space (Q,T,Q). Let £ : £1 — > be a random variable defined on the same probability space and 
independent of W . There is a unique, continuous process Y t : [0, oo) — > which is adapted to 
the augmented filtration Tt and satisfying the following, Q-almost surely: 

rtAr B ptAT B 

(1.11) Y t = i+ I K(Y s )ds + V2a(Y s )dW s , t>0 

Jo Jo 

where 

t b = inf{t >0\Y t £B}. 

Moreover, Yt A for all t > 0. 

The augmented filtration is defined in the usual way, Tt being the a-algebra generated by 
, Yq, and the appropriate collection of null sets so that Tt is both left- and right- continuous. 
We will use E to denote expectation with respect to the probability measure Q. 

Observe that if d = 1, a = l/y/2 is constant, and 6=0, then q(x) is a linear function, and 
(jl.9p corresponds to a Bessel process of dimension 3. For example, if A = (— oo, 0), B = (1, oo), 
we have 

dY t = ±-dt+ dW t , 
and the function Z t = (Y t ) 2 satisfies the degenerate diffusion equation 

(1.12) dZt = 3dt + 2y r Z t dWt. 

In this simple case, existence and uniqueness of a strong solution starting at Yq = can be shown 
using arguments involving Brownian local time (sec [RY99, KS91J). However, those arguments 
are not applicable to the more general setting we consider here. The work most closely related 
to Theorem 11.11 in a higher dimensional setting may be that of DeBlaissie [DeB04| who proved 
pathwise uniqueness for certain SDEs having diffusion coefficients that degenerate like \J d(Zt) 
where d{z) is the distance to the domain boundary (as in (|1.12p ). In an earlier work, Athreya, 
Barlow, Bass, and Perkins [ABBP02J proved uniqueness for the martingale problem associated 
with a similarly degenerate diffusion in a positive orthant in R . Nevertheless, those analyses 
do not apply to the case (|1.9p considered here. 

Our next result is the following theorem which shows that the law of the reactive trajectories 
is that of the process Yt with appropriate initial condition. For this reason, we will call the 
process Yt the transition path process. 
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Theorem 1.2. Let X t satisfy the SDE il.l}) . Let Y k denote the k th A — >■ B reactive trajectory 
defined by (|1.3p . Let Y be defined as in Theorem Then for any bounded and continuous 
functional F : C([0, oo)) —> R, we have 

E\F(Y k )} = 1 \f(Y) I Y ~ X - 

L T A,J 

The processes X t and are defined on a probability space that is different from the one on 
which Yt is defined. The notation Yq ~ X - used in Theorem 11.21 means that Yq has the same 

T A,k 

law as X r - , meaning Q(Y~ S U) = W(X T - G U) for any Borel set U C R d . 

1.2. Reactive exit and entrance distributions. The distribution of the random points X - 

_ T A,k 

will depend in the initial condition Xq. Prom the point of view of sampling the transition paths, 
however, there is a very natural distribution to consider for Yq. To motivate this distribution 
formally, let h > and consider the regularized hitting times 

(1.13) TA,h = 'mf{t>h\X t e A} 

(1.14) T B>h = inf {t > h | X t G B) , 
where X t satisfies (jl.ip . Then define 

?/i(aO = P(T Ai h > r B ,h \ X = x). 

This is the probability that at some time s G [0, h], the path Xt starting from x G dA becomes 
a transition path, not returning to A before hitting B. With this in mind, the quantity 

i]A,h( x ) = h~ 1 p(x)F(T A ,h > r B ,h \X = x) = h~ 1 p(x)q h {x), 

may be interpreted as a rate at which transition paths exit A, when the system is in equilibrium. 
Therefore, a natural choice for an initial distribution for Yq G dA is: 

rj A (x) = lim rj A ,h- 
h— >0 

By the Markov property, we have 

(1.15) q h (x) = [ P(r A >r B \X = y)p(h, x, y) dy = E[q(X h ) \ X Q = x] 

where p(t, x, •) is the density for Xt, given Xq = x. Therefore, for any x G OA we have 
lim h~ l q h (x) = lim hr X E[q(X h ) - q(X ) \ X = x] = Lq(x), 

in the sense of distributions, although q is not C 2 on <90 = dA U dB. Hence r]A : h{x) — > t]a( x ) = 
p(x)Lq(x) for x G dA. The distribution Lq is supported on d@. If <j> is a smooth test function 
supported on a set B r (x), a small neighborhood of x G dA, then we have 



(Lq,4>) = / q(x)L*(f)(x) dx 

= / Lq{x)<p{x) dx + I (qn ■ dw (a<$) — (fi ■ aV q)(f> + qn ■ b<fi) da a{x) 

JB r (x)ne J(dA)(lB r (x) 

where n(x) is the unit normal vector exterior to 0, and da a is the surface measure on dA. Since 
q = on dA and Lq = on 0, this implies, 



(Lq,(j)) = — / 0n • aVqdaA(x) 

J(dA)nB r (x) 
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That is (after a similar calculation for points on dB), 

(1.16) Lq(x) = —n(x) ■ a(x)Vq(x) daA(x) — n(x) ■ a(x)Vq(x) d<Ts(x), 
in the sense of distributions. Restricting on dA, we get 

(1.17) t]a = —p(x)n(x) • a(x)Vq(x) dovi(x). 

By switching the role of A and B in the above discussion, it is also natural to define a measure 
on dB as 

(1.18) t]b = p(x)n{x) ■ a(x)Vq(x) das{x). 

Note that 1 — q gives the forward committor function for the transition from B to A and that 
Lq(x) = r]A(dx) — risidx). Although the distributions t]a and tjb are positive (by (|1-7|) ). they 
need not be probability distributions. Nevertheless, the mass of the two measures is the same. 

Lemma 1.3. The measures tja and tjb satisfy r/A(dA) = r]B(dB). That is, 



(1.19) / p(x)n{x) ■ a{x)V q{x) da a{x) + I p{x)n(x) ■ a{x)Vq{x) dasix) = 0. 
JdA JdB 

This computation motivates us to define 

(1.20) ?]7(da;) = —rjA(dx) = p(x)n(x) ■ a(x)V qix) da a(x) ■, 

v v 

(1.21) 77^(dx) = — risidx) = —p(x)n(x) ■ a(x)Vq(x) daB(x), 

We call these distributions the reactive exit distribution on dA and on dB, respectively. The 
constant v is a normalizing constant so that rf^ and define probability measures on dA and 
dB. By Lemma 11.31 the normalizing constant is the same for both measures. Our next result 
relates the reactive exit distribution on dA to the empirical reactive exit distribution on 
dA, defined by 

1 N ^ 

(1-22) »In=kY, S X t - 

k=0 A ' k 

Proposition 1.4. Let Pan ^ e the empirical reactive exit distribution on dA defined by (|l,22p . 
Then Pan converges weakly to n~ A as N — » oo. That is, for any continuous and bounded 
f : dA -V R 



lim 

N^-oo 



holds F-almost surely. 



dA 



f{x)dp AN {x) = / f{x)dn A {x) 
JdA 



A similar statement holds for the reactive exit distribution on dB and the empirical distri- 
bution of the points X T - . The reactive exit distribution n~^{dx) is related to the equilibrium 

measure eA,B(dx) in the potential theory for diffusion processes |Szn98, BEGK04, BGK05J. In 
fact, the committor function q is known as the equilibrium potential in those works, and the 
equilibrium measure eA,B(dx) is given by Lq restricted on dA. Specifically, we have 

(1.23) VA.(dx) = -p(x)eA,B(dx). 

To the best of our knowledge, Proposition 11.41 for the first time characterizes the equilibrium 
measure from a dynamic perspective. 
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We also identify the limit of the empirical reactive entrance distribution on dB, defined 

as 

1 N ^ 

(1-24) 4,at = ]vE^ + (*)■ 

fc=o B ' k 

To describe its limit as N — > oo, let us denote by L the adjoint of L in L 2 (W d , p(x)dx), given by 

2 o 
(1.25) Lu = -b ■ Vu + - div(ap) • Vu + ti(aV z u). 

P 

This corresponds to the generator of the time-reversed process t i— > Xx-t [HP86j . Note that 
L = L if the SDE (jl.ip is reversible, z.e. L is self-adjoint in L 2 (R rf , p(x) dx). In addition to 
the forward committor function q(x) (recall (|1.5p ). we also define the backward committor 
function q(x) to be the unique solution of 

Lq = 0, x£0 

with boundary condition 

'l, x£dA 

o, x g as. 

In terms of q, we define the reactive entrance distribution on dB as 



(1.26) r)~t(dx) = p(x)n(x) ■ a(x)V q(x) da b{%) 

v 

and analogously the reactive entrance distribution on dA 

(1.27) r/J(dx) = —p(x)n(x) ■ a(x)Vq(x) d<r^(x). 

Again, v is a normalizing constant so that these are probability measures; v is the same as the 
constant in (jl.20p . The following proposition justifies the definition of the reactive entrance 
distribution. 

Proposition 1.5. Let p □ N be the empirical reactive entrance distribution on dB defined by 
(jl.24p . Then /it N converges weakly to rjg as N — > oo. T/tai is, for any continuous and bounded 
f : dB -»• R 



lim / f(x)dp% N {x) = / /(x)dry+(x) 

! ^°°JdB JdB 



iV->oo _ 

ZioWs F-almost surely. 

A similar statement holds for the reactive entrance distribution on <9A and the empirical 
distribution of the points X + . 

T A,fe 

Remark 1.6. // £/te S'-D.E (jl.ip zs reversible, we have q = 1 — q, and hence 77^ (dx) = 77^ (dx) 
and 77^ (dx) = 77^ (dx). 

In view of Proposition 11.41 77^ is a natural choice for the distribution of Yq. With this 
choice, the transition path process Yt characterizes the empirical distribution of A — > B reactive 
trajectories, as the next theorem shows: 
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Theorem 1.7. Let X t satisfy the SDE (jl.ip . Let Y k denote the k th A — >■ B reactive trajectory 
defined by (|1 .3|) . Let Y be the unique process defined by Theorem with initial distribution 
Yq ~ n^(dx) on dA defined by (|1.20|) . and let Q - denote the law of this process on X = 

C([0,oo)). Then for any F G L 1 (X,B,Q-), the limit 

'a 

1 

lim - £ = E[F(y)] 

N^oo TV * — ' 
fc=0 

ZioWs F-almost surely. 

In particular, the limit E[i ? (y)] is independent of Xo Using Theorem II .71 several interesting 
statistics of the transition paths can be expressed in terms of the quantities we have defined. 
Actually, Proposition ll.4l is an immediate corollary of Theorem II .71 by choosing F(Y k ) = f(Y^), 
so we will not give a separate proof of Proposition 11,41 



1.3. Reaction rate. Let Nt be the number of A — >• B reactive trajectories up to time T: 



N T = 1 + max{/c > | rt k < T}. 

k ' 



The reaction rate ur is defined by the limit 

Nt , k 
(1.28) v R = lim — =- = lim 



T->oo T k— too Tg j. 

and it is the rate of the transition from A to B. Also, the limits 

1 N ~ x 

A:=0 

and 

1 iV_1 

fc=0 

are the expected reaction times from A — > B and B — >■ A, respectively. The reaction rate 
from A — > B and B — >• A are then given by kAB = ^Xb an( ^ = ^BA' Another interesting 
quantity is the expected crossover time from A —> B 



1 N ^ 

k=0 

which is the typical duration of the A — > B reactive intervals. Observe that Cab < Tab- 
Similarly, we define 

( L32 ) ^:=Jim -^(r+ fe+1 -r- fe ). 

fc=0 

The next result identifies these limits in terms of the committor functions and the reactive exit 
and entrance distributions. 
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Proposition 1.8. The limits (OBI . (fL29l) . (OUJ) . (Oil) , and (fL32l) hold ¥- almost surely, and 
vr = v= I p(x)Vq{x) ■ a(x)Vq(x) dx. 



Tab = [ r]\(dx)u B {x) = — [ p(x)q(x)dx. 
JdA V R JR d 

T B a= [ r}%{dx)u A (x) = — f p(x)(l - q(x))dx. 
JdB V R JR d 

Cab = [ r]A(dx)v B (x) = — [ p{x)q{x)q(x) dx. 
JdA V R JR d 

Cba= [ rj~(dx)v A (x) = — [ p(x)(l - q(x))(l -q(x))dx. 



IdB V R 

Here ub(x) = E[tJ" | Xq = x] is the mean first hitting time of X t to B, and v R (x) = E[r^ | Yq = 
x] is the mean first hitting time ofY t to B. Similarly, if q is replaced by (1 — q) in the definition 
ofY, then va{x) = E[rJ | Yq = x]. Recall that v is the normalizing factor for the reactive exit 
and entrance distributions. 

The formula for i/ R , Tab, an d Tba were obtained in [EVE06 . We also note that the crossover 
time for the transition path process in one dimension was recently studied by |CGLM12j . 

1.4. Density of transition paths. We now consider the distribution p R as defined in [EVE06 : 
(1.33) p R {z) = lim i / 5(z-X t )I R {t)dt, z£6, 

where R is the random set of times at which Xt is reactive: 

oo 

R=[jKk^B,k}- 
k=0 

This distribution on O can be viewed as the density of transition paths. By Proposition 11.81 
and Theorem 11.71 we can describe pa in terms of the transition density for Yt . Specifically, for 
any continuous and bounded function / : M. d — > M, we have 

/ f(z)p R (z)dz = u R lim -L [ T f(X t )I R (t)dt 

^ N-l ,. T + _ r - 

1 x — T / B,k ' A,k , , , 

= v * J im / f( Y t ) dt 



vrE 



k=0 

ft B 



f B f{Y t )dt \Y Q ~ni 

Jo 



vr / QR(t,7] A ,z)f(z)dz dt. 
Jo Je 



Here QR(t,r] A , z) is the density of Y t , with Yq ~ r] A , and killed at dB 

(1.34) Q R (t, Va~,z) = Q(Y t edz,t<t B \Y ~ n A ), 
and tg is the first hitting time of Yt to B. Hence, for z € @, 

(1.35) Pr{z) = vrI Q R (t,r] A ,z)dt. 

Jo 
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Proposition 1.9. For all z € G, 

(1.36) p R {z) = p(z)q{z)q{z). 

This formula for pr was first derived in |Hum04yEVE 06j . 

1.5. Current of transition paths. The density Qn(t,r]^, z) satisfies the adjoint equation 

d 

■^Q R (t,7 lA ,z) = {L q yQ R {t,ri A ,z), ze@ 
where (L q )* is the adjoint of L q : 

(Li)*u = ^Tia^zMz))^ - Y,{Ki{z)u{z)) Zi 

and K is defined by (jl.lOp . Integrating from t = to t = oo we see that Pr(z) satisfies 

(Liy PR (z) = o, zee. 

In divergence form, this equation is 

(1.37) V z ■ J R (z) = 0, 
where the vector field 

J R {z) = PR (z) (b(z) - + d\v(a(z)p R (z)) 

(1.38) V ?(*) / 



b(z)p(z) - div (a(z)p(z)) J q(z)q(z) + p(z)a(z)[q(z)X7q(z) - q(z)Vq(z) 

is continuous over 0. The vector field Jr(z), identified in [EVE06], may be regarded as the 
current of transition paths (see Remark I1.13|) . Observe that if the SDE (jl,l|) is reversible, 
we have q = 1 — q and 

b(z)p(z) - dw(a(z)p(z)) = 0, 
and hence the current given by (|1.38p simplifies to 

Jr(z) = p{z)a(z)Vq(z). 

This was observed already in [EVE06] . 

On the boundary, the current (|1.38p is related to the reactive exit and entrance distributions. 

Proposition 1.10. We have 

Jr = paVq on dA, and Jr = —paVq, on dB, 

and hence, 

r] A (dx) = — v^}n(x) ■ Jr(x) daA(x) and r/^(dx) = v^}n{x) ■ Jr(x) daB(x). 

As an immediate corollary, we have an additional formula for the reaction rate. 

Corollary 1.11. Let S be a set with smooth boundary that contains A and separates A and B, 
we have 



(1.39) vr = I n(x) ■ jR(x)das(x), 

JdS 

where n is the unit normal vector exterior to S. 
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The current Jr generates a (deterministic) flow in O stopped at dB: 

dZ z 

(1.40) —± = j R (z*), forO<t<t B , Zl = z 

where t B = £b( z ) is the time at which Z t reaches dB. As Jr is divergence free in G, Jr • n < 
on dA, and Jr ■ n > on dB, t B (z) is finite for any z£0. The flow naturally defines a map 
&j R '■ dA —7- dB: given any point z £ dA, we define 

(1.41) $ jR (z) = Z t * B edB. 
Proposition 1.12. For any f G C 1 ^), 

(1.42) / f(x) V +(dx)- [ f(x)n A (dx) = — [ JR-Vfdx. 
JdB JdA V R Je 

In particular, 

$JrAva) = vi, 

where <&j R ,*(r] A ) is the pushforward of the measure rf^ by the map 

Hence, Jr characterizes "the flow of reactive trajectories" from A to B. 

Remark 1.13. Note that by Proposition \l-4\ and Proposition \1.5l the left hand side of (jl.42|) 
is equal, P- almost surely, to the limit 

N-l 

l im ±X^(f(X + )- f(X - )). 

n=0 

If Xt was dijferentiable, we would have 

N-l 



lim AyVf(X+ )-f(X- )) = lim — - [ l R (t) — f(X t )dt 



[ dxVf(x)- lim I [ T X t 5(x-X t )l R (t)dt », 
Je T ^°° T Jo 



N-+oc N 

n=0 

vr Je 

Combining this with Proposition \1.1SX we arrive at a formal characterization of Jr 

Jr"= lim i [ T X t 5(x-X t )l R (t)dt ". 
This formal expression was used in [EVE06] to define Jr. 

1.6. Related work. As we have mentioned, our work is closely related to the transition path 
theory developed by E and Vanden-Eijnden [EVE06, MSVE06, EVE10], which is a framework 
for studying the transition paths. In particular, based on the committor function, formula for 
reaction rate, density and current of transition paths were obtained in [EVE06]. Our main 
motivation is to understand the probability law of the transition paths. The main results 
Theorem 11.11 Theorem 11.21 and Theorem 11.71 identify an SDE which characterizes the law of 
the transition paths in C([0, oo)). Therefore, as an application of these results, we are able to 
give rigorous proofs for the formula for reaction rate, density and current of transition paths in 
[EVE06j. We note that in the discrete case, a generator analogous to (jl.8j) was also proposed 
very recently in jVE13j for Markov jumping processes. 

The transition paths start at dA and terminate at dB, and hence they can be viewed as 
paths of a bridge process between A and B. In this perspective, our work is related to the 



12 



JIANFENG LU AND JAMES NOLEN 



conditional path sampling for SDEs studied in [SVW04llRVE05llHSVW05llHSV07j . In those 
works, stochastic partial differential equations were proposed to sample SDE paths with fixed 
end points. However, the paths considered were different from the transition paths as their time 
duration is fixed a priori. It would be interesting to explore SPDE-based sampling strategies for 
the transition path process identified in Theorem ll.il 

Let us also point out that in the work we present here we do not assume that the noise a is 
small, as is the case in the asymptotic results of [BEGK04, BGK05, CGLM12], which we have 
mentioned already, and also in some other works, such as the large deviation theory of Freidlin 
and Wentzell [FW84| . 

The rest of the paper is organized as follows. Theorem 11.11 and Theorem 11.21 are proved 
in Section In Section we prove Lemma 11.31 Proposition 11.51 and Theorem 11.71 related to 
the reactive entrance and exit distributions. As we have mentioned, Proposition 11.41 follows 
immediately from Theorem 11.71 so we do not give a separate proof of it. Proposition II. 8( 
Proposition 11.91 Proposition 11.101 Corollary 11.111 and Proposition 11.121 are proved in Section SJ 

2. The Transition Path Process 

Proof of Theorem Without loss of generality, we prove the theorem in the case that i = yo 
is a single point in 0. The interesting aspect of the theorem is that yo is allowed to be on <90, 
since the drift term is singular at <90. If we assume that yo 6 G, then existence of a unique 
strong solution up to the time ta A tb follows from standard arguments, since K(y) is Lipschitz 
continuous in the interior of 0. That is, if yo £ @ 5 there is a unique, continuous J^-adapted 
process Yt which satisfies 

ftA(T A /\T B ) ftA(T A AT B ) ^ 

(2.1) Y t = y + / K(Y s )ds + / V2a(Y s ) dW s , t>0. 

Jo Jo 

Moreover, if yo £ ®j then we must have ta > tb > almost surely. This follows from an 
argument similar to the proof of [KS91, Proposition 3.3.22, p. 161]. Specifically, we consider the 
process z t = l/q(Y t ) 6 M, which satisfies 

rthr _ 

ztAr = z - V2(z s ) 2 Vq ■ a dW s 

Jo 

where r = tb A r e with r e = inf{f > | q(Y t ) = e}. Since r < oo with probability one, we have 

z = E[z tAT ] = ^Q(r e < t b ) + Q(r e > r B ). 

Hence Q(r e < t b ) < q(e)(z - 1). So, Q{t a < t b ) < lim e ^oQ(r e < tb) = 0. 

Now suppose yo £ dA. In consideration of the comments above, it suffices to prove the desired 
result with tb replaced by r r , the first hitting time to dB r (yo) n 0, where -B r (yo) is a ball of 
radius r > centered at yo- Thus, we want to prove existence and pathwise uniqueness of a 
continuous J^-adapted process Y t : [0, oo) — > satisfying 

rtAr r l-tAr r _ 

(2.2) Y t = y + K(Y s )ds+ V2a(Y s )dW s , 

Jo Jo 

where 

T r = inf{t >0\Y t e dB r (y ) n } . 
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It will be very useful to define a new coordinate system in the set -B^(yo) = B r (yo) n G and 
to consider the problem in these new coordinates. For r > small enough we can define a 
C 3 map (h^(y), . . . ,h^ d ^(y),q(y)) ■ Bt{yo) — > M. d 1 x [0, oo), such that the scalar functions 
h®(y) :B+(y )^R satisfy 



(2.3) (Vh^(y),a(y)Vq(y)) =0, V y € B+(y ), i = l,...,d-l. 

Furthermore, the map may be constructed so that it is invertible on its range and that the 
inverse is C 3 . The existence of such a map follows from the regularity of dA, the regularity of 
q, and the fact that (n,aVq) ^ on dA by (|1.7j) . 

For two initial points x\,X2 £ 0, let Y" 4 Xl and Y t X2 denote the unique solutions to (I2.ip with 
Yq 1 = x\ and Yq 2 = X2 respectively. That is, 

r tAr B r tAT B 

(2.4) Yf = x+ K{Y x )ds + V2a{Y s x )dW s , t > 0, 

Jo Jo 

where is the first hitting time of Y t x to dB. Changing to the coordinate system defined by 
(h^(y),...,h^ d - 1 \y),q(y)), we denote 

(h 1>t ,q ltt ) = (h(Y^),q(Y^)) and (h 2 , t , q 2>t ) = {h{Y t x % q(Y t x >)). 

Let t~ and r 2 denote the first hitting times of Y t Xl and Y X2 to the set dB r (yo)n@. The processes 
(hi t t,qi,t) and (h2,t,q2,t) are well-defined up to the times t} and t 2 , respectively. 

We can control the difference between (ft,*) and (h,2,t,q2,t)' 

Lemma 2.1. There is a constant C such that for all xi,x 2 £ B r /2{yo) H © 



E 

and 

E 



max (?i )tA T - q2,thr) 2 
te[o,T] 



max |/il jtA T - ^2.Mr| 2 
te[0,T] 



< - X 2 



1/2 



< C|xi - x 2 \ 



where r = t}. A x f 2 . 

The proof of Lemma 12. II will be postponed. One immediate corollary is the following. 
Corollary 2.2. There is a constant C such that for all x%,x 2 £ B r / 2 (jjo) D 

(2.5) Q( max lY*' 1 - Y* 2 ! > a ] < CcT 2 bi - x 2 \ 1/2 , V a > 0, 

\0<t<(TAT) J 

where r = r. 1 A t 2 . 



Proof. On the closed set {z £ M | z = (h(y),q(y)), y £ Br{yo)}, the map y h-> (h(y),q(y)) 
is invertible with a continuously differentiable inverse. Hence there is a constant C, depending 
only on the map y h-» (h(y),q(y)) such that 

|Yr- y n <C(IH*- Vl + K*-92,t|), Vi£[0,r]. 
By combining this bound with Chebychev's inequality and Lemma 12. II we obtain (I2.5p . □ 
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Now suppose yo E dA. Let {x n }^ =1 C G be a given sequence such that x n — > yo as n — > oo. 
For each n, define Yf n by (f!T4"|) . and let r™ denote the first hitting time of Y Xn to dB r (y ) n 9. 
We may choose the points x n so that |x n — yo\ < 25~ n . Define r n = r" +1 A r™. Applying 
Corollary 12.21 we conclude 



max \Y; " +1 - Y f Xn \ > 2~ n < C2 zn 5- n . 

0<t<(TA? n ) ) 

Therefore, by the Borel-Cantelli lemma, the series 

oo 

(2.6) V max \Y? n+1 - Yf n \ < oo 

^0<t<(TAf") 
n=l 

with probability one. Let us define 

(2.7) r r = lim inf T r n = lim inf r n . 

n— >oo n— >oo 

We will prove that r r is positive: 

Lemma 2.3. For all r > sufficiently small, Q(r r > 0) = 1. 

In view of (|2.6p and Lemma [2.3|, we conclude that there must be a continuous process Yt such 
that, with probability one, 

Yf n -»■ F t 

uniformly on compact subsets of [0,r r ), as n — > oo. Let us define 

(2.8) r r/2 = inf{t > | Y t 6 dB r/2 (y ) n 6}. 

Lemma 2.4. For all r > sufficiently small, Q(f r / 2 G (0, r r )) = 1, and r r / 2 is stopping time 
with respect to Tt- 

We will postpone the proof of Lemma 12.31 and Lemma 12.41 Since r r / 2 < T r , Y Xn — > Yt 
uniformly on [0, f r / 2 }- Let us now replace Y t by the stopped process Y tA r r / 2 - Since each Yf n is 
J-f-adapted, so is the limit Y t . We claim that Y t satisfies 

r tA T r / 2 rtAf r /2 

(2.9) Y t = y + / K(Y s )ds+ / V2a(Y s )dW s , t > 0. 

Since Y^" -> F t uniformly on [0,r r / 2 ], we have (q(Y x "),h(Y Xn )) -> (g(Y*), fr(*t)) uniformly on 
[0,f r / 2 ], and (gt,/it) = (?(Yt), ^(Yt)) satisfies 

rtAfr/2 ftAf r /2 . 

(2.10) h t = h Q + f(q a ,h a )ds+ m(q s ,h s )dW s , 

Jo Jo 

and 

(2.11) % - / g(q„ h s ) ■ dW s = lim / mQs )l ds. 

Jo ""Wo 9s 

for all t £ [0,f r/2 ], where (q x ",h Xn ) = (q{Y Xn ),h{Y Xn )). (Recall q = 0.) Since of" > 0, the last 
limit can be bounded below using Fatou's lemma: 

(2.12) qt - r" 9{q „h,).4w,> r^ im «M?£Md ! = f'" r "' \^.M\\ s 

Jo Jo n ^°° Qs n Jo Qs 



REACTIVE TRAJECTORIES AND THE TRANSITION PATH PROCESS 15 

Recall that \g(q s , ti2)\ 2 > C r > 0. In particular, with probability one, the random set H = {s S 
[0, r r / 2 ] \ q s = 0} must have zero Lebesgue measure; if that were not the case, then we would 
have 

ftAr r/ / 2 



g(q s ,h s ) ■ dW s = +oo, 

for all t in a set of positive Lebesgue measure, an event which happens with zero probability. 
Therefore, by Fubini's theorem, 

= 1 / I H (s)ds= [ Q(s < f r/2 , q s = 0) ds 
Jo Jo 

which implies that Q(s < f r /2, q s = 0) = for almost every s > 0. Since f r / 2 > almost surely, 
this implies that we may choose a deterministic sequence of times t n £ (0, 1/n] such that, almost 
surely, qt n > for n sufficiently large. By then applying the same argument as when yo £ 0, we 
conclude that qt > for all t > t n . Hence, qt > for all t > must hold with probability one. 
Since qt is continuous, we now know that for any e > 0, 



m.mq t > 0. 

t>e 



holds with probability one. In particular, 



lim inf min qf " > 0, 

n— >oo t>e 

so that 

lim rw^;-)P Js= p 

n^ooJ e J t Qs 

almost surely. Since qt is continuous at t = 0, we also know that 

lim lim / l9{Qs n ds = lim (q £ - g(q a , h s ) ■ dW t 

almost surely. Returning to (|2,lip we now conclude that 

rtAT r / 2 f tATnA€ \g(q Xn ,h Xn )\ 2 

Qt- g(q s , h s ) ■ dW s = lim lim / s ' s ds 

+limlim r~ \9(?,"f,'T is 

(2.13) -""-J- 

e->0 



\g(Qs,h s )\ 2 dc 



to Qs 

holds with probability one. Equation (|2.9p for Yt now follows from (|2.10p and (|2.13p by changing 
coordinates. 

Except for the proofs of Lemma \2.1\ Lemma 12. 3\ and Lemma 12.41 we have now established 
existence of a strong solution Yt to (|2.2p (with r replaced by r/2). The uniqueness of the 
solution follows by the same arguments. Suppose that Y^ and Y 2 both solve (|2.2p with the 
same Brownian motion and the same initial point Yq = Yq = yo- Then Corollary 12.21 implies 
that, Q almost surely, Y^ = Y t 2 for all t £ [0, A r 2 \ where t} and t 2 are the corresponding 
hitting times to dB r (yo) n @. In particular, = t 2 . This proves pathwise uniqueness. □ 

We now prove Lemma 12. 11 Lemma l2.3l and Lemma [2.41 to complete the proof of Theorem ll.il 
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Proof of Lemma \2. 1[ By Ito's formula the process (hi,qi) = (hij, qi,t) satisfies 

(2.14) dh x = f( qi , h 1 )dt + m( gi , h r ) dW t , 

(2.15) d Ql = |g(9lA)|2 dt + g{q Xt h x ) • dW t , 

oi 

for < t < r/, where the functions g = V2(Vq) T a G R. d , f = Lh G R d_1 , and m = \/2(V/i) t ct G 
]R( d_1 ) xd , are all Lipschitz continuous in their arguments over Bt ■ Similarly, (h 2 , q 2 ) = {h 2 ,t, <?2,t) 
satisfies 

(2.16) d/i 2 = /(<? 2 , /i 2 ) dt + m(q 2 , h 2 ) dW t 

(2.17) dg 2 = |g(g2l/t2)|2 dt + 5 (g 2) ha) • dW t , 

02 

for < t < t^. Notice that the choice of coordinates satisfying (|2.3p has eliminated a potentially 
singular drift term in the equations for hij an d hs,t- On the other hand, the drift term in the 
equations for q\ and q 2 blows up near the boundary q = 0. Indeed, if r > is small enough, by 
(jl.7p there is a constant C r > such that 

(2.18) inL2(Vq(y)),a(y)Vq(y)} > 2A m±_\Vq(y)\ > C r . 
y eB+ y eB+ 

Hence, 

(2 19) Wl,tM,t)? = 2{Vq(Y t ^),a(Y^)Vq(Y^)) > 2A taf_|Vg(y)| > C r > 0. 

v ' ; yeB+ 

Letting r = t} A and using (|2.14p and (|2.16p , we compute 

d \h x - h 2 \ 2 = 2(fci - /i 2 ) T (/(gi, h!) - figs, h 2 )) dt 

+ 2{h x - h 2 ) T (m( gi , hi) - m(q 2 , h 2 )) dW t 

+ tr((m(gi, h\) - m(q 2 , h 2 ))(m(q 1 ,h 1 ) - m(q 2l h 2 )) T ) dt 

for < t < t. In particular, 

E [\h 1MT - h 2:tAr \ 2 ] < C [ E [l [0 , r] (s)(<?i )S - g 2 ,s) 2 ] ds 



(2.20) 



+ C f 1 [I[ ,r](s)|/ii, s - /i 2 , s | 2 ] ds + C\ Xl -x 2 \, 
Jo 

< C I E [(gi )S AT - 92,sAr) 2 ] ds 

JO 

+ C / E[|/i 
Jo 



^2,sat| I ds + C\x\ - x 2 \, 



1,sAt ~~ 'i2,sAr 

^0 

holds for all t > 0. 

From (j2. 15|) and (|2.17p we also compute 

d (qi - q2) 2 = 2(<?i - g 2 )d(gi - q 2 ) + \g x - g 2 \ 2 dt 

(2.21) =2( qi -q 2 )(^-^)dt 

V 11 02 J 

+ 2(?i - g 2 )(si - 52) • dWt + | 5 i - 52 | 2 dt 
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for < t < t, where we have used the notation g\ = g(qi,h\) and g 2 = g(q2, h 2 ). We claim that 
there is a constant C, depending only on r, such that 

(2.22) 2(qx - q 2 ) ( ^ - ^) < C(\ Ql - q 2 \ 2 + \h x - h 2 \ 2 ) 

holds for all t < r, with probability one. Both sides of (|2.22|) are invariant when (qi,h\) and 
(q 2 ,h 2 ) are interchanged. So, we may assume q\ < q 2 without loss of generality. We consider 
the following two possibilities. First, suppose that 

(2-23) < qi \\ gi \ 2 - \g 2 \ 2 \ < (q 2 - qi )\ gi \ 2 . 

Using this and q\ < q 2 we have 

V <?i Q2 J 

( 2 - 24 ) = 2^^1 ((q 2 - qi )\ 9l \ 2 - qi (\g 2 \ 2 - \ gi \ 2 )) 

q\q2 

(f2T23]l 

< 0. 

The other possibility is 

(2-25) < (q 2 ~ qi)\9i\ 2 < qi\\gi\ 2 ~ \g 2 \ 2 \- 

In this case, we have (also using q\ < q 2 ) 

2( gi ~ q 2 ) f M - = ((q 2 - qi )\ 9l \ 2 - q l{ \ 92 \ 2 - \ 9l \ 2 )) 

V qi Q2 ) q\q2 

<_ 2 (^lM gi( | 52 |2_ k|2) 
<?1<?2 

(2.26) < 2 fc-^|| ff2 | 2 -| 5l | 2 | 

< 2 ^-^|M 2 -bil 2 | 

^o (|g2| 2 -|gl| 2 ) 2 

Therefore, since |<?i| > C r > (by 12.191 ). we must have 

2(gi - a) f ^ - ^) < 2C- 2 (| 52 | 2 - | 5l | 2 ) 2 < - g 2 | 2 + - h 2 \ 2 ). 
\ qi q2 J 



where C > depends only on r. This establishes (|2.22p . 
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Returning to (|2.2ip and controlling the first term on the right hand side of (|2.2ip with (|2.22p , 
we conclude that 



(2.27) 



E [(gi ltA T " q2,tArf] < C E [l [0)r] (s)(g 1)S - q 2:S f] ds 

Jo 

+ C [ E [l [0>T] (s)\h hs - /i 2 , s | 2 ] ds + C\xi - x 2 \ 
Jo 

< C I E [(qi,sA T ~ ^2,sAr) 2 ] ds 
Jo 

+ C / E [\hx sAt - /i 2 , s at| 2 ] ds + C|xi - x 2 |. 



By combining (|2.20p and (|2.27|) and applying GronwalPs inequality, we conclude that 
(2.28) E [\h 1>tAT ~ h 2M r\ 2 ] + E [(gi, tAT - g 2 ,tAr) 2 ] < C\x x - x 2 \ (l + ie ct ) , t > 0. 
Using (f!T2"T|) and (f2~22]) we also obtain 



E 



(2.29) 



max (<?l, tA T - ?2,tAr) < C / E[(5l,sAr ~ ^2,sAr) ] ds 

ue[o,T] J Jo 

+ C / E[|/ii )SAr - /i 2 , sA r| 2 ] ds + C\x\ - x 2 \ 



+ E 



max Vt 

te[o,T] 



where Vt is the martingale 



ptAT 

V t = / 2( gi - q 2 )( 9l - g 2 ) ■ dW s . 
Jo 



By the Burkholder-Davis-Gundy inequality (e.g. RYi)9l Sec IV.4]) and (|2.28j) . we have 



E 



max 

te[o,T] 



T x 1/2 

<C( / E[(g MAr - g 2 , sAT ) 2 ] ds < C T \x x - x 2 \ 1 / 2 . 



This, together with (T2T28D and (i!T29t gives us 

E 

Similar arguments for h\ — h 2 lead to 



max ( qi thT - q 2M r) 2 
te[o,T] 



< Crbi - x 2 



1/2 



E 



max |/ll tAT - tl 2 t AT \ 
iS[0,T] 



< Ct|xi - x 2 |. 



□ 



Proof of Lemma \2.3l Suppose r r = holds with probability e > 0. Because of (|2.6p we may 
choose m sufficiently large so that 

<x> 

V max \y. Xn+1 -Y, Xn \ < r/4 

^ 0<t<(TA?") 
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holds with probability at least 1 — e/2. Therefore, with probability at least e/2 we have both 
r r = and 

(2.30) liminf \Y* n - YSr | < r/4. 

n— >oo T r 

Recall that [Yq" 1 — yo\ < 25~ m . Let m be larger, if necessary, so that 25~ m < r/4. This and 
(12301) imply that 

liminf \Y?£ - y \ < liminf (\Y?£ - Y%»\ + \Y*T - y \) < r/4 + 25" m < r/2 

holds with probability at least e/2. However, this contradicts the fact that Y x " G dB r {yo) for 
all n. Hence, we must have r r > with probability one. □ 



Proof of Lemma \2.4\ The fact that T, r / 2 > with probability one follows from an argument very 
similar to the proof of Lemma 12.31 The fact that f r /2 < T r will follow by showing that 

(2.31) limsup \Y t — yo\ > r 

t/r r 

holds with probability one. First, suppose that r" < r r and that 

r™ = inf r r fc 

k>n 

Then by (|2.6|) we have 

\Y T? - y | > \Y% - yo\ - \Y T? - Y&\ = r - \Y T n -Y%\ = r- R(n). 
where R(n) is the series remainder 

oo 

R(n) = V max \Y, Xk+1 - Y Xk \ 



k=n 

which converges to zero, with probability one, as n — > oo. So, with probability one, if there is 
an increasing sequence of such times Tr j /* T r as j — > oo, we see that (|2.3ip must hold. On the 
other hand, suppose there is no such sequence. Then we must have r" > r r for n sufficiently 
large. Hence Y t Xn must converge to Y t uniformly on the closed interval [0,r r ]. Suppose r™ > r r 
and r™ = sup fc>n r*. Then for all k > n, we have 

\y% - y | > \r r y - yo \ - \y% - Y r y\ 

= r - |Y*» - Y x k k \ > r - M(n). 
Therefore, since Y t Xn is continuous on [0, r™] and since r r = liminf fc>o Ty, we have 

\Y x r n - yo| > t - M(n). 

Since Yj? n — > l^- r in this case and Yi is continuous on [0, r r ], then with probability one, this case 
also implies that (|2.31|) holds. Having established that < r r / 2 < r r we conclude that Y t Xn — > Y t 
uniformly on [0,r r / 2 ]- Since each Yf n is J^-adapted, so is the limit Yt. In particular, ? r / 2 is a 
stopping time. □ 
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Remark 2.5. Let us point out that if yo G dA and T > is sufficiently small, the equation 

(2.32) Y(t) = y + f K(Y(s))ds, t e [0,T]. 

Jo 

has a unique solution satisfying Y(t) G for all t G (0,T]. Indeed, let z{t) solve the ODE 

z'(t) = 2a(z(t))Vq(z(t)) + q(z(t))b(z(t)) 

for t G [0, T], with z(0) = yo- F° r sufficiently small T, z(s) G for t G (0, T}. Hence q(z(s)) > 
for t G (0, T] and the function F(t) = q(z(s)) ds is invertible. Now, it is easy to check that 
the function Y(t) = z(F~ 1 (t)) is continuous on [0, T] and satisfies (|2.32p . Moreover, Y(t) G O 
for all t G (0,T]. In fact, 

(Vg(y ),a(yo)Vg(yo)) i/2 

/or small t. 

We state and prove two properties of the transition path process, which will be used later. 
Proposition 2.6. Let F be a bounded and continuous functional on C([0, oo)). Define 

g(x) = E [F(Y) \Y = x] 
where Yt satisfies (ll.llj) , T/ien 5 G C(G). 

Proof. Suppose that {x n }" =1 C and that x n — > x G as n — > 00. We claim that there must 
be a subsequence {x nj )Y=i such that, Q-almost surely, 

(2.33) lim F(Y j ) = F(Y), 

where Y/ satisfies (jl.lip with Yq = x n ., and Y t satisfies (jl.lip with Yo = x. Since F is bounded 
and continuous on C([0, 00)), the dominated convergence theorem then implies that 

lim g(x nj ) = lim E [F(Y) \ Y = x nj ) = E \F(Y) \Y = x} = g{x). 

j— >oo j— >oo 

Since the limit is independent of the subsequence, this implies that g(x) is continuous. 

To establish (I2.33p . we must show that Y/ — > Y t uniformly on compact subsets of [0, 00). This 
follows from Corollary 12.21 as i n the proof of Theorem 11.11 □ 

Proposition 2.7. For any R > 0, there are constants k\,k2 > such that 

Q(Y t G I Y = x) < k ie ~ k2t 
holds for all t > and x G 0, |x| < R. 

Proof. If x G 0, then by the Doob h-transform, we know that 

¥(X S G Vs G [0,t], t b <t a \X = x) 



G I Yq = x) 



< 



F(t b <t a \X = x) 
F(X S G Vs G [0, t] I X = x) A P(r B < t a \ X 

F(t b <t a \X = x) 
F(X S G Vs G [0, t] I Xq = x) A q{x) 
q(x) 
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Since the process X t is ergodic, there must be constants C\, C2 such that 

F(X S eeVsG [0, t]\X = x)= ¥(X S $ A U B Vs G [0, t] \ X = x) < C x &~ Cit 
for all \x\ < R, t > 0. So, for any e > 0, 

A p-C-it A 

(2.34) Q(y t G 9 I Y = x) < -± 

e 

holds for alH > and x G {x G | |x| < i?, g(x) > e}. 

The bound (|2.34p does not include points near dA, where q(x) < e. Fix e G (0, 1) and define 
the set S = {x G | q(x) < e} U A. If e is small enough, this set is bounded and we may assume 
|x| < R for all x G S. Suppose Yq = x with x G S n 0. Let = (7(F), which satisfies 



% = qo + T ds + / y(F s ) dW s 

Jo Qs Jo 



where g(y) = v / 2(Vq , (y)) T cr(y). By (jl.7p we know that if e > is small enough, there is a 
constant C e > such that |y(y)| 2 > C t for all y G S n 0. Therefore, if F 4 G 5 n for all 
t G [0, T], we must have qt < e for all i G [0, T] and 

<7t> [ t — ds+ [ g(Y s )dW s >te~ 1 C e + f g(Y s )dW s 
Jo Qs Jo Jo 

for all i G [0, T]. This happens only if the martingale Mt = J * g(Y s ) dW s satisfies 

M t < e - te~ l C t , te[0,T]. 

To control the probability of this event, for any a > 0, (3 > 0, T > 0, Chebychev's inequality 
implies 



P 2 f 1 i_i2j.M < e -/3"T+4i|| 9 ||^T 



exp(y / Iff I <is 



By choosing /3 = a/Hyll^ we have Q(Mt < — aT) < e a2c 3 T . Hence there is a constant C4 such 
that 

(2.35) Q(Y t eSne, v t g [0, r] 1 y = x) < e- t2CiT 

holds for all T > 1 and x G 5 n 0. 

Now we combine (ETMl) and (l2T35|) . Let r 5 = inf{t > | F ( G 55}. By (l2735|) we have 
Q (t s > t/2 \Y = x)< e~ C5t holds for all x G Sn 0. Therefore, since rs is a stopping time, we 
conclude that 

Q(Y t G I Y G x) < Q(Y t G 0,r 5 < t/2 | Yo G x) + e" C5 * 

< sup Q (y t/2 G I Y G y) + 
yeds 

e 

for all x G 5 D 0. □ 

Proof of Theorem \1.2l Since r^" n is a stopping time, it suffices to prove the result for n = 0. Fix 
e > and let S D A be the open set 



S = {x G I q(x) < e} UA. 
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For e > small, this is a bounded set that separates A and B. The boundary dS is an isosurface 
for q: q(x) = e for x G dS. As e — >• 0, 5 shrinks to A, and the Hausdorff distance d^(dS,dA) is 
0(e) (because of ([T77]) ). 

Recalling that t^ = inf{t > | Xt G j4}, we define 

rs.o = inf{t > r+ | X t G 95}. 

which is a stopping time with respect to Ft- Then for k > 0, we define inductively the stopping 
times (see Figure [2]) 

r A>k = inf{i > r s ,k \ X t G A}, 

r B ,k = inf{£ > r s ,k \ X t E B}, 

rs,k+i = inf{i > r A ,k \ X t G dS}. 

Observe that rs t k < TA,k < rs,k+i, although it is possible that rs,k = rB,fc+i- Let rAB,k = 
r A,k A ^_e,fc, which is finite with probability one. We also define the random time 

r S j = mf{t > r A . | X t G dS}. 




Figure 2. (Left panel) The set S and random times tsj- (Right panel) Zoom-in 
of the boxed region together with stopping times r$ : k and r A ^. 

Although tsj is not a stopping time with respect to Ft, the relation 
(2.36) {r s ,k\k>0, r Bsk < r A ,k} = {r S ,j}T=o 

holds P- almost surely. 
Now, let 

Y t ° = X (t+T - o)AT+Bo , t>0, 

and let ho = ts$ — t ao- Since F is bounded and continuous, and since ho — > (F almost surely) 
as e — > 0, we have 



(2.37) nnX. +T - )) = E[F(Y )] = limE[F(Y° +ho )}. 

We will show that 

M[F(Y° +ho )}=E[g(X T - o )] 

where g(x) = E[F(Y) \Y =x]. 
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Let M be the unique (random) integer such that 

ts,o = rs,M- 

Equivalently, M = mm{k > | rg & < r^k}- Since Tbu > r A,k f° r all k < M, we have 

M oo 

(2.38) F(Y° +ho ) = ^F(X. +rs k )l rB k<rA k = ^2 F(X. +rs k )I T . B k<rA k I k < M . 

k=0 k=0 

Observe that the event {k < M} coincides with the event that tb j > r^j for all j < k, so the 
event {k < M} is measurable with respect to F rsk . Therefore, we have 



H F ( Y °+h () )} = E E [ F ( X -+r S ,k) I r B , 



k<M\ 



k=0 

oo 



E E [H F ( X - +r S ,k) I r B ,k<rA,k I k<M I ?rs,i 



k=0 

oo 



[h<M nF(X. +rSth )Ir B , h <r Alh I F rs J 

k=0 

oo 

^E[l fc < M /(X r 



fc=0 



where 



fix) = E[F(X.)I TB<TA \X = x] = q(x)E[F(Y) \ Y = x]. 

The last equality follows from the Doob ^-transform (since x 6 dS C O here). Since q(x) = e 
for all x € dS, this means 

M 



(2.39) 



E[F(Y° +hn )]=eE 



E 9(X.. 



rs,k, 



.k=0 



where g(x) = E[F(Y.) \Yq = x]. Note that the random integer M depends on e. 

Let Aj denote the event {j < M}, which occurs if and only if r^k < rs,k for all k E 
{0, 1, . . . , j}. Since q{x) = e for all x G dS, the event Aj is independent of X rs £ Moreover, 
P(Aj) = (1 - e)*' +1 , since 



P(Aj) = E 



E 



.fe=0 

'i-i 

. fc=0 



(l-e)P(^-_i). 



Similarly, P(M = j) = e(l - e) J '. Now we evaluate (f!T39l : 



E[F(X° +?l0 )] = eE[ 5 (X rSj0 )] + eE 
= eE[ 5 (X rSi0 )] + eE 



E 



k=l 

oo 



E^5(^ +1 )] 

i=o 
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eE[ 5 (X rSi0 )]+e^E[l A . 5 (X rs> . +i ; 



3=0 



= eE[g(X rso )\ +eY / F(A j )E[ g(X rSj+1 ) ] 

j=0 
oo 

= eE[g(X r3t0 )) + ej> - e)i +1 E [ g(X rs . +1 ) ] 
j=0 

oo 

= Y,e(l-e)*E[g(X r8J )] 

j=0 
oo 

= J2HM = j)E [ giXrgJ } = E [ g(X TSfi ) } . 

j=0 

Now let e — > 0. Since g(x) is bounded and is continuous up to dA by Proposition 12.61 we have 
(by the dominated convergence theorem) 



(2.40) limEb(X rSi0 )] = E[lim 5 (X TSi0 )] = E[g(X 



3. Reactive Exit and Entrance Distributions 
Proof of Lemma ll.tA The equality (|1.19|) is equivalent to 

p{x)n(x) ■ a(x)Vq(x) das(x) = 0. 



□ 



'88 

Using ()1.16p . it is then equivalent to 

(p,Lq) = (L*p,q)=0, 

which is obvious. □ 

Before proving Proposition II. 5\ we will need establish some properties of the entrance and 
exit distributions and of the harmonic measure associated with the generator L. These results 
will also be used later in the paper. First, using integration by parts, we have 

Lemma 3.1. Let D C M. d be open with smooth boundary. Let <f),ij) E C 2 (D) n C l {D) and 
bounded. Then 



(3.1) / p(x)((j)(x)Lip(x) — ^(x)L4>(x)^ dx = / p(x)b ■ n(x)4>(x)ijj(x) da£)(x) 

JD JdD 

+ J p(x)<j)(x)n{x) ■ aS/ip(x) — ip(x)n(x) ■ dw(a(x)p(x)4>(x)) dao{x), 

JdD 

where n{x) is the exterior normal vector at x £ dD. 

Let us recall some tools from potential theory (see for example the books [Pin95, Szn98] and 
also [BEGK04, BGK05J where potential theory was applied to analyze diffusion processes with 
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metastability) . The harmonic measure Hu(x,dy) is given by the Poisson kernel corresponding 
to the boundary value problem 



Lu(x) = 0, x G D, 
u(x) = f(x), x G dD. 



(3.2) 

Therefore, for / G C(dD), 

(3.3) u(x) = [ H D (x,dy)f(y), 

JdD 

is the unique solution to ()3.2j) . Similarly, the harmonic measure Hn(x,dy) corresponds to the 
generator L (recall (jl.25p ). For the boundary value problem 



Lu(x) =0, x € D, 
u(x) = f(x), x G dD, 



(3.4) 

the solution is given by 

(3.5) u(x)= [ H D (x,dy)f(y). 

JdD 

The harmonic measures have a probabilistic interpretation: H£>(x,dy) (resp. H£>(x,dy)) gives 
the probability that the process associated with the generator L (resp. L) first strikes the 
boundary dD at dy after starting at x. In particular, 

q(x) = Hn(x,dB) and q(x) = Hc(x,dA). 

We also define the harmonic measures for the conditioned processes as 

(3.6) H q e (x,dy) = ^H e (x,dy). 

For x G © this is a measure on dB. For x G dA where q(x) = 0, we may define H^x^dy) 
through a limit: 

..... —-H (x,dy) = — ^— — - — -— 

x'gB q{x) n[x) ■ a(x)V x q{x) 



(3.7) Hl{x,dy) = ijm ^L He (x,dy) = H ZT'JJZ. ',1 > * e dA. 



Recall that q(y) = 1 for y G dB. 

Recall the reactive exit and entrance measures rj^, T]\, r]g and rjg. They are connected by 
harmonic measures as follows: 

Proposition 3.2. 

(3.8) / r,2(dx)H^(x,dy)=r,+ (dy). 

JdA 

(3.9) / 7 1 +(dx)H sc (x,dy) = V +(dy). 

JdA 

(3.10) / 7 ] +(dx)H AC (x,dy) = rj+(dy). 

JdB 
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Proof. We prove $£E§ first. If / e C{dB), let u f {x) solve Lu = in 6 with 



f{x), xedB, 

0, x £ dA. 



(3.11) it 
Hence u(x)q(x) = on dQ. By applying (|3.ip with <f>(x) = q(x) and ip(x) = Uf(x), we obtain 



p(x)n(x) ■ a(x)Vuf(x) da A {x) = / f(x)n(x) ■ div(a(x)p(x)q(x)) das(x) 
dA JdB 

(3.12) = / f(x)p(x)n(x) ■ a(x)Vq(x)da B (x) 

JdB 



dB 

f( x )v B (dx). 

dB 



From (j3.7|) and (|1.20|) . we see that for all x G dA, 



i] A (dx)H^(x,dy) = - p{x)n(x) ■ a(x)V x H e (x,dy) da A (x). 
dA JdA 

Hence for any / £ C(dB), we have 

( I r)2(dx)H q e (x,dy)) f{y) = - [ [ p(x)n{x) ■ a{x)V x (f(y)H e (x,dy)) da A (x) 

dB \JdA J JdB JdA 



p(x)n(x) ■ a(x)V x / H e (x,dy)f(y) da A {x) 

\JdB J 

p(x)n(x) ■ a(x)V x Uf{x) dx. 

dA 

Combining this with (|3.12p . we conclude that 

[ ([ 7 ] 2(dx)H^x,dy))f(y)=[ f(x) V +(dx), VfeC(dB), 
JdB \JdA J JdB 

which proves ()3.8p . 

To prove (|3.9p . let rp solve Lip = for x £ B c with ip = f on dB. Then by (|3.1|) with 
(p = 1 — q , we have 

n\{dx)Tp{x) = j p(x)n(x) ■ a(x)Vq(x)ip(x) da A (x) 
,L\ JdA 

p{x)n{x) ■ a(x)V(l — q{x))ip{x) da A (x) 

dA 



— / ip(x)n(x) • div(ap(l — q)) da A (x) (since 1 — q = on dA) 
JdA 

/ fn ■ div(ap(l - q))da B {x) - / fpb-nda B (x) 
JdB JdB 



IdB JdB 
— / pn ■ aV ip da b{x) ■ 

JdB 

Applying f)3. 1 1) with the function (p = 1, we also find that 



= — / fn • div(ap) do~B{x) + / / pb ■ n da b{x) + j pn ■ aVip das(x). 
JdB JdB JdB 
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Therefore, since 1 — q = 1 on dB, we conclude that 



ri\(dx)il)(x) = I fn- div(ap(l - q)) da B (x) - / fn(x) ■ div(ap) da B {, 
JdB Job 

= / fpn ■ aV(l - q) da B {x) 
JdB 

= -/ fpn-aVqda B (x) = fij£(dx). 
JdB JdA 



We arrive at (|3.9p noting that 



ip(x) = / H S c{x,dy)f{y). 

IdB 



We omit the proof of (|3.10p which is analogous to that of (|3.9p by switching the role of A 
and B. □ 

By combining (|3.9j) and (|3.10p we immediately obtain the following: 

Corollary 3.3. Let P B (x,dy) be the probability transition kernel 

P B (x,dy)= H A c(x,dz)H §c (z,dy), x,y G dB 
JdA 

on dB, and let Pji(x,dy) be the probability transition kernel 

P A (x,dy)= H B c(x,dz)H AC (z,dy), x,y G OA 
Job 

on OA. Then 

JxedB 

and 

ri\(dx)P A {x,dy) = V A {dy)- 



Ix&dA 

That is, r]g and rj A are invariant under P B and P A , respectively. 

We are ready to return to the proof of Proposition 11.51 

Proof of Proposition \1.5\ We first verify that r]g is a probability measure. Taking tp = q and 
(p = q in (|3.ip . we obtain using the boundary conditions of q and q on dA and dB, 



7] A (dA) = — [ pn-aVqda A = — [ n ■ div(apq) da B 
v Ja A v JdB 

= — I n ■ apVqda B = rf^ipB). 

v JdB 



IdB 

This shows that rjg(dB) = 1 and v is the correct normalization constant. 

Let g be a positive continuous function on dB. Define for x ^ B, 
(3.13) u(x)=E[g(X TB )\X = x]. 

Hence u satisfies the equation 

f Lu{x) =0, x G B°; 



(3.14) 



u(x) = g(x), x G dB. 
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Let Hg c (x,dy) be the harmonic measure (the measure of the first hitting point on B for the 
process starting at x). We have 

(3.15) u(x) = [ H §c (x,dy)g(y). 

Job 

By the maximum principle, u > in B c . By the Harnack inequality and the compactness of 
dA, we have 

(3.16) sup u(x) < C inf u(x) 

where the constant C > only depends on the elliptic constants of a; in particular, C is 
independent of g. Therefore, we obtain for any x, x' £ dA, y 6 dB 

(3.17) < C- 1 < ^ T Bci : X ;^\ <C<oc. 
If we define 

(3.18) u(dy)= inf H^dy), 

xGaA 

then v{dy) > on dB and 

(3.19) H B c(x,dy)>C~ 1 u(dy) 
for any x £ dA. 

Consider the Markov chain given by {X T + on dB. Let Pb denote its transition kernel, 

given by 

(3.20) P B (y,dy')= [ H Aa (y,dx)H Bc (x,dy'). 

JdA 

By (|3.19p . Pb satisfies Doeblin's condition: 

(3.21) P B (v,dy') yC- 1 [ H Ac (y,dx)v(dy) = C- x v(dy). 

JdA 

Therefore, Pb has a unique invariant measure. By Corollary 13. 2>\ this invariant measure is given 
by r/g. The convergence in Proposition 11.51 now follows (see e.g. |MT09| ). □ 



Proof of Theorem \1. 7| Consider the family of processes 



v A,n _ v 
A t - A/ 



Observe that the n reactive trajectory t \— > Y t a is a subset of the path t \— > X t ' n ; specifically, 
= X ,n _ + for all t > 0. The random sequence of points 

t + T A,n~ T A,n 

y n = X^ n = X r+ G dA, 71 = 0,1,2,... 

A,n 

corresponds to a Markov chain on the state space dA with transition kernel 

P A (x,dy) = P(y„+i edy\y n = x)= / H B c(x,dz)H A c(z,dy). 

JdB 
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As shown in the proof of Proposition 11.51 this chain has a unique invariant probability distribu- 
tion r)\ supported on dA: 

/ r]\{dx)P A {x,dy) =r]\(dy). 
JdA 

The sequence of processes t i— > xf' n corresponds to a Markov chain on the metric space 
X = C([0, oo)). It can be shown that this is a Harris chain with unique invariant distribution 

P(U)= [ rjX(dx)P x (U), VU£B 

JdA 

where V x denotes the law on (X, B) of the process 1 1-> Z t /\ TB where 

dZ t = b{Z t ) dt + V2a(Z t ) dW t , Z = x 

and tb is the first hitting time of Z t to B. (The uniqueness of V follows from the uniqueness of 
r]\ as an invariant distribution for the chain defined by transition kernel Pa on dA.) Therefore 
(see e.g. |MT09j ). for any $ G B X {X , B,P) the limit 

1 N 

(3.22) km - $(X A ' k ) = nHZ. ATB ) I Z ~ r,+] 

JY^oo JV ' — ' 

k=l 

holds P- almost surely. 

Using (13.22j) we will establish the following relationship between 77^ and r\\: 

Lemma 3.4. Let X t satisfy the SDE with initial distribution Xq ~ rj\ on dA. Then for 
any Borel set U C dA, 

F(X- G U | X ~ Va) = Vl(U) = ~ f P{x)n{x) ■ a(x)Vq(x) da A (x). 
t a,o A v Ju 

Proof of Lemma \3.4\ Let / G C(M d ) be bounded and non-negative. Then by applying (|3.22[) to 
the functional $(X) = f(X T - ), we obtain 

1 N ~ x 

J™ N E K X -rs,J = limIE[/(X rs ,) | X ~ rj+] 

e— >U JV— »oo JV ' — * e— >0 
n=0 

= E[/(X r - o ) | X ~t/+]. 

We also have, 

jY-1 jr-i 

(3.23) lim £/(X TSin )= lim — km - £ /(^ fc )I r ^ <rA , fc = / /(x)Cs(dx), 

TV— >oo JV ' — ' If— >-oo JVft- if— >oo A ' — ' ' /flc' 

n=0 A fc=0 °* 

kolds P-almost surely, wkere iV# = |{& G {0, 1, . . . , K — 1} | rs,k < r A,fc}|- Here we kave used 
Qs to denote tke unique invariant distribution (identified below) for tke Markov ckain defined 
by X rs k on dS. Tkerefore, 

E [f(X ) | X ~ 77+] = km / /(x)Cs(dx). 

T A,0 A e^O J ds 

We claim tkat if f{x) is uniformly continuous in a neigkborkood of dA, tken 

(3.24) km / ( S (dx)f(x) = [ r)A~(dx)f(x). 

J OS JdA 
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First, let us identify the invariant distribution £5. By applying Corollary 13.31 (replacing B by 
S c ) we can identify as 

Cs(dx) (= Vs( dx )) = -^P{x)n{x) ■ a(x)Vq s (x)da s (x), 
where n(x) is the exterior normal at x G dS, and q$ satisfies Lqs = in S with 

(l, xGM 



Qs{x) 



0, i£ 95. 



Note that f is independent of e. Let 5 > e be small, and suppose that f(x) is continuous on the 
closed set {x G | < q(x) < 5}. (This set contains both dA and dS). A computation similar 
to (|3.12p (replacing B by S) shows that for any such function, we have 

(3.25) / ( s (dx)f(x) = -- / p(x)n(x) ■ a(x)Vuf >s (x)da A (x), 

JdS v JdA 

where Uf t g satisfies Lu = in S \ A, and 

ff(x), xedS 
\o, x G dA. 

Since / > 0, we have u > in S \ A. Now, let us define 

z f,s( x ) = e " / f f ! xG5\A 

which satisfies L 9 z = in 5 \ j4, with z = / on 95" (recall that g(x) = e for all x G 95). By 
the boundary Harnack inequality ( |Bau844lCS05j ). Zf t s(x) is bounded and Holder continuous on 
S \ A (including dA). We claim that for any xq G dA, we have 

(3.26) lim Vu /j5 (x) = e' 1 z f s (x )Vq(x ). 

Since Vu^g, Vg, and 2^5 are continuous up to dA, this is true if and only if 

lim q(x)Vzf s(x) = 0. 

Suppose q(x)Vzj j s(x) ->u^Oasi->ioG dA. Then we must have 

lim Vuf s(x) — Zf s(x)Vq(x) = v 

X— >Xo 

so that v must be a multiple of n(xo) (since u and q vanish on dA). Thus, we would have 

(3.27) n(x ) ■ Vz ft s(x) ~ (n(x ) • v)q(x)~ 1 

as x — > xq G <9A If i> 7^ 0, then (n(xo) • v) 7^ 0, so (|3.27p and the fact that g = on cM would 
contradict the boundedness of Zf t s(x). Therefore, (|3.26p must hold. 
Combining f|3.25|) and (|3.26p we obtain 

Cs(dx)f(x) = / p{x)n(x) ■ a(x)Vq(x)z f ^ s (x)da A (x) = [ rf^{dx)zf S {x). 

dS v JdA JdA 

Therefore, as e — > 0, 

(3.28) lim / ( s (dx)f(x) = lim / r]^(dx)z fS (x) = r/ A (dx)/(x). 
*+°Jd8 ^°JdA JdA 

This establishes (13.24p and completes the proof of Lemma I3.4L □ 
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Now we continue with the proof of Theorem 11.71 We will apply Theorem 11.21 Suppose that 
F £ L 1 (Af,S, Q„-), and define the functional 

$m = f(x, , - u + ). 

Combining Theorem 11.21 and Lemma 13.41 we see that 3> € L 1 (A' ,B,V), since 
P($(X) > a) = > a | X ~ r/+) 

= Q(F(Y) >a|F ~^) = Q(F(y) > a)- 

Therefore, 

1 N-l JV-1 JV-1 

fc=0 fc=0 ' ' fc=0 

By (|3.22p and Theorem 11.21 we now conclude that the limit 
1 N ~ 1 

lim - ^ F(y fc ) = E[$(Z. Ars ) | Z ~ 7?+] = E[F(Y) | Y ~ V A ] 

iV— >oo iv — 
fc=0 

holds P-almost surely. This completes the proof of Theorem 11.71 □ 

4. Reaction rate, density and current of transition paths 
4.1. Reaction rate. 

Proof of Proposition ["Ol Denote tb the first hitting time of X% to B. Consider the mean first 
hitting time 

u B (x) =E[t b \ X = x], 

which satisfies the equation 



(4.1) 

By definition of ry^, we have 



Lub(x) = —1, i£0 
ub(x) = 0, x £ <9.B. 



(4.2) / //~[ (u.r r) — — j p(x)uB(x)n(x) ■ a(x)Y iji.r) drr \[ .>■ ) . 

Observe that 



dA v JdA 



p(x)q(x)dx= / p{x)q(x)dx 



p(x)q(x)(LuB)(x) dx 

= — I p(x)(Lub)(x) dx — / p(x)q(x)(LuB){x) dx. 
J A JO 

Using ()3.ip with D = A, <j)(x) = 1 and ip(x) = ub, we obtain 

/ p(LuB)dx = — j pb ■ nus d<TA{x) — / pn ■ oNub dovi(x) + / usn • div(ap) do~A(x), 
J A JdA JdA JdA 
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where n is the interior normal vector at dA. Apply (|3.ip again with D = 0, <j) = q and ijj = ub, 

/ pq(LuB)dx= / pb ■ hub d<JA{x) + / pn ■ clVub daA{x) — / Mgn ■ dxv(apq) da A (%)■ 
JO JdA JdA JdA 

Combining the two with ()4.2p . we get 

/ rj A ~(dx)uB(x) = — I pusn ■ aVqdcjAix) = — j pqdx. 
JdA v JdA v Jm d 

Similarly, defining ua{x) to be the mean first hitting time of Xt to A starting at x, we have 

/ rfe{dx)u A (x) = - [ p{l-q)dx. 
JdB v JR d 

Add the integrals together to obtain 

/ r]\(dx)u B {x) + / r]^{dx)u A {x) = -. 

JdA JdB v 

On the other hand, observe that 
— = lim — — 

1 

71=0 

. N-l ^ N-l 

= J™ n S «n ~ <n) + Jjm - £ (r+„ +1 - r+J. 

n=0 n=0 



As iV — > oo, we have 



1 7V_1 , /■ 

Tab = lim — ^(r^„ - rj ) = E[t b \ X ~ ^] = / f?^(dx)u B (x), 



n=0 

and similarly 



1 , /■ 

T BA = lim — ( T A,n+l ~ t bJ = / Vb ( dx ) v •>(•'• ' 

Therefore 

— = / r/^(dx)ti B (x) + / r/+(dx)n A (x) = -, 
^-R JdA JdB v 



or equivalently u = vr. 

From Theorem 11.71 it follows immediately that 



Cab = I rj A {dx)v B {x). 
JdA 

Indeed, the functional F : Y — > is in L X (X, B, Q^-) by Proposition E2J The function 
vb(x) = E[r^ | Yq = x] satisfies 

lh b = -l, x g e 

with t>(x) = for x G (95. Hence, the function w(x) = q(x)vB(x) satisfies Lw = — g for x G 
with boundary condition u;(x) = for x G <96. Moreover, for xo G cM, we have 

u)(x) n(x ) • a(x )Vw(x ) 

v B (x ) = lim — - = — . 

i->io n(xo) • a(xoJ V(7(xo) 
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Therefore, 

r/2(<ix)vB(x) = / p(x)n(x) ■ a(x)Vw(x) dcr^x). 

dA v JdA 



Now applying (|3.ip with D = 0, 4> = Q and tp = w, we have 

I p(x)n(x) ■ a(x)Vw(x) <1(ta(x) = — [ p(x)q(x)q(x) dx. 

v JdA v Je 

It remains to show that 

v = / pVq ■ aVq dx. 

jR d 

Using integration by parts, we have 

/ pVq-aVqdx= I pV [q — — ) • aVqdx 

= - / V • {paVq) (q--)dx+ [ p(q - -)n ■ aVq da A (x) 
Je z JdA 1 

+ / p{q~\)n- aVqda B (x). 
JdB 1 

The first term on the right hand side vanishes as 

f V • (paVq) — -) dx = f (ptraV 2 q + pb-Vq\(q—-)dx 
Je 2 J @ \ J 2 



+ \ j_ (div(pa) • V - pbV^j (q 2 - q) dx 

J P(Lq){q ~ 5) dx - i f e (L*p)(q 2 ~ q) = 0, 



where we have used that q 2 — q = on dA U dB. The conclusion then follows from Lemma I1.3} 
q = on dA, and g = 1 on <9i?. □ 

4.2. Density of transition paths. We define the Green's function G& of the operator L in 
with Dirichlet boundary condition on dQ: 



(4.3) 



LG @ (x,y) = -5 y (x), x G G, 
G e (x,y) = 0, xeSe. 



The existence of the Green's function is guaranteed by the ergodicity of Xt in M. d , which implies 
that Xt is transient in (see e.g. |Pin951 Section 4.2]). 

Lemma 4.1. Let G@ be the Green's function of L in with Dirichlet boundary condition on 
d@. We have 

(4.4) G%(x,y)= rQ R (t,x,y)dt= qiy)Geix > y) 



q(x) 



In particular, for x £ dA, y G 
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Proof. Fix y G O. For x G O, (|4.4p follows from [Pin95, Proposition 4.2.2]. Specifically, the 
function G|,(x,y) defined by 

G q e (x,y) = / Q R (t,x,y)dt 



is related to the Green's function (|4.3j) by the formula 

CL(*,v) = q(v)G ? { ™\ .,,6 8. 



Because of the regularity of the coefficients a(x) and 6(aj), Schauder-type interior and boundary 
estimates imply that G(-,y) G C 2 ' Q (0 \ {y}). Since G{x,y) = q(x) = for x G dA, the Hopf 
Lemma implies that for all x G dA, V x G(x,y) is a nonzero multiple of n(x). That is, for all 
x G cL4, V x G(x,y) = r(x)n(x) for some continuous r(x) < 0. The same is true for q. Therefore, 
is continuous in x up to the boundary d® and for xq G dA, 



lim Lr e (X,y) — — r — — - , 

x-*xo,xe& n(xo) ■ a{Xo)\/ q{Xo) 



It remains to show that for xq G dA, 

( 4 -6) ^r—, , r= ; =/ Q R (t,x ,y)dt. 

n(x ) ■ a{x )\/q(x ) J 

Let tp > be smooth and compactly supported in 0. By Proposition 12.61 we have 

lim %p{Y t ) \Y = x) = E[<p(Xt) I *o = x Q ]. 

X^Xq 

Moreover, 

E[ip(Y t ) \Y = x}< \\ip\\ooQ(Y t G e | y = x). 
By Proposition 12.71 for any R > 0, there are constants k\,fv2 > such that Q(Y t G G | Yq = 
x) < he~ h2t for all x G 9, \x\ < R, t > 0. Therefore, we have E[<p(Y t ) \Y = x] < |M|oo&ie~ fc2i 
so the dominated convergence theorem implies that 

r poo 

lim / G g e (x,y)ip(y)dy= lim / E[<p(Y t ) \ Y = x] dt 

poo 

(4.7) = / E[tp(Y t ) I Y = x ] dt 



Q(t,x ,y)(p(y)dy dt. 



(-> 



On the other hand, we also have 

(4.8) lim [ G%( X ,y) v ( v)A y = ( fr. 



x->x J e w J e n(x ) ■ a(x )Vq(x ) 

Therefore, by combining (|4.7p and (|4.8p we conclude 
q{y)n(x Q ) ■ a(x )V x G e {xo,y) 



. , _ , . <p(y)dy= / / Q(t,x ,y)(p(y)dydt 
e n(x ) ■ a(xo)\/q(x ) 



o Je 

oo 



Q(t,x ,y)dt) (f(y)dy. 

Since (p is arbitrary, this implies ()4.6p . □ 



e 
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Proof of Proposition \1.9l Using Lemma 14.11 and (jl,35p , 

(4.9) Pr(z) = » r [ n A (dx)G q e (x,z). 

JdA 

Recall the explicit formula of rf^ in terms of q (jl.20p . we obtain for 260 

Pr(z) = ~ [ p(x) 9 ^ n j\ \ aV _" Gei f' Z ^ n{x) ■ aVq{x)da A (x) 
JdA n (x) ■ aVq(x) 



= ~q{y) / p(x)n(x) ■ aV x G e (x, z) da A (x). 
JdA 

Apply (|3.ip by taking -ip(x) = Gq(x,u) and 4>(x) = q(x), we conclude that 

PR{V) = ~q(y) / p{x)<p{x)n(x) ■ aVip(x)da e (x) 
J 80 



= -q(y) / p{x)4>{x)Lij}(x) 
Je 

= p(y)q(y)q(y)- 

Here to get the second equality, we have used that Lq = in and ip{x) = on dQ. □ 
4.3. Current of transition paths. 

Proof of Proposition \1.1(A It follows from a direct calculation from the definition of Jr as f|1.38jl , 
noticing that q = 0, q = 1 on dA, and q = 1, q = on dB. □ 

Proof of Corollary By Proposition 11.101 we have 



vr = - / n(x) ■ Jr(x) da A {x). 

IdA 



Hence, it suffices to show that 



n{x) ■ Jr(x) da A {x) + / n(x) ■ Jr{x) da s (x) = 0, 
OA JdS 

which follows from the fact that Jr is divergence free in (see (|1.37p ). □ 
Proof of Proposition \1.VA Using Proposition II . 101 for the left hand side of (|1.42|) , we obtain 

/ f(x)VB(dx) - [ f(x)rjj(dx) = —[ fn-J R da B + —[ fn-J R da A , 

JOB ' JdA Vr J dB Vr Jg A 

where n is the unit normal exterior to 0. Equation (jl.42p then follows from the divergence 
theorem. 

Now fix any g G C 1 (dB), we extend g to using the flow (jl.40n : for any x S 0, we define 
(4.10) g(x)=g(Zf B ), with Z% = x. 

In particular, for x 6 dA, we have g(x) = g($j R (x)), in other words, 

(4.H) g\ 9 A = $j R (g\dB)- 

By the construction (|4.10p . for any x G 0, Jr ■ Vg = 0. Combining with the first part of the 
Proposition and (14. lip , we obtain 

g(x)rj+(dx)= gr]j(dx). 
dB JdA 
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Therefore, <S>j R ,*{v A ) = Vb- 



□ 
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